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architectures compared with other convolutional neural networks (CNN) architectures in 
OP-CNN order to validate its capability. The strawberry dataset was employed to train 
Plant diseases detection and test the models since the strawberry is one of the main crops in the 
Strawberry plant Larache Province (Morocco). The experimental tests demonstrate that our 


proposed OP-CNN reaches the highest values versus DenseNet121, VGG19, 
and ResNet50 with 100%, 99%, 97%, and 63% respectively for 
classification accuracy, 100%, 100%, 98% and, 79% respectively for 
precision, 100%, 99%, 97%, and 63% respectively for recall, and 100%, 
99%, 97%, and 58% respectively for "F" _1Score. 


This is an open access article under the CC BY-SA license. 


Corresponding Author: 


Sara Belattar 

Department of Computer Science, Faculty of Polydisciplinary, Abdelmalek Essaadi University 
B.P 133, Main Post 92150, Ksar-el-kebir, Morocco 

Email: sarah.belattar12 @ gmail.com 


1. INTRODUCTION 

The agriculture domain serves as a significant sector in developing countries such as Morocco, and 
it contributes an important gross domestic product (GDP) percentage to their national revenue. Due to the 
weather conditions and the biotic factors, agricultural productivity is decreasing day by day. Climate change 
and other factors lead to diseases in the plant, which have an impact on livestock and human health. For that, 
plant diseases are a major source of concern, and they limit the increase in agricultural productivity. In 
Morocco, most farmers use traditional techniques to detect diseases in plant crops by inspecting the quality of 
their plants daily. This issue increases the number of labourers and decreases the crop productivity. Recently, 
artificial intelligence (AI) tools represent a good and appropriate solution to avoid the traditional techniques 
used in the agricultural sector by offering the intelligence proprieties such as learning, self-adapting, 
classification capability, and more [1]-[5]. These benefits enable us to create intelligent agricultural systems 
that will help farmers in improving their agricultural productivity, which is our study's goal. In this regard, 
many approaches and procedures for identifying and diagnosing plant diseases have been developed and 
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tested by researchers. As per [6], the authors have proposed an intelligent diagnosis system for plant diseases 
named PD?SE-Net to identify various plant diseases and estimate their severity. For training different plant 
images, they are used ResNet-50 architecture as the basic model. In Tiwari et al. [7], the authors proposed the 
dense convolutional neural networks (Dense-Net) to detect plant diseases and classification using leaf 
images. In Abas et al. [8], the authors have discussed the prospect of using VGG 16 architecture for plant 
classification. In Wang et al. [9], a trilinear convolutional neural networks model (T-CNN) was used to 
identify and detect the plant diseases. In Kumar et al. [10], the authors suggested the exponential spider 
monkey optimization (ESMO) for plant diseases identification. In Neelakantan [11], various machine 
learning algorithms were developed in order to find the best technique for classifying plant diseases. As 
consequence, plant diseases diagnosis is a very challenging topic in precision agriculture. 

As seen from previous works cited above, the deep neural network architectures have been widely 
applied in the diagnosis and the detection of plant diseases with significant results. However, the use of other 
new convolutional neural networks (CNN) architecture is still required to reach high performance with fewer 
parameters. This paper proposes the convolutional neural networks architecture versus other algorithms 
founded in plant diseases detection and diagnosis due to their capabilities in identifying the diseases of plants 
and their ability to extract the features of the image, which achieves high accuracy in comparison with 
machine learning algorithms [12]. In this study, we built our model using CNN (OP-CNN) for strawberry 
plant diseases diagnosis. The strawberry plant is among the main crop of the Larache Province (northern 
Morocco). The performance of the proposed architecture is compared with the most commonly used CNN 
architectures in various works, such as DenseNet 121, VGG 19, and ResNet 50. 

The rest of this paper is organized as follows: section 2 presents and describes our proposed OP- 
CNN architecture. Section 3 describes all methods used for plant diseases diagnosis. Section 4 provides the 
result and discussion of the performance between our proposed architecture and other CNN architectures. 
Finally, the conclusion of the work and perspectives were highlighted in section 5. 


2. THE PROPOSED OP-CNN ARCHITECTURE 

In this article, we aim to create our architecture (OP-CNN) to diagnose strawberry plant diseases for 
helping the farmers to make the right decisions. The proposed OP-CNN will be evaluated by comparing it to 
other CNN architectures in terms of the performance. At the beginning of the proposed OP-CNN training 
procedure as shown in Figure 1, we preprocessed our dataset, and then we applied the data augmentation 
technique to increase the number of plant images. 
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Figure 1. The proposed OP-CNN architecture 


The proposed OP-CNN consists of six convolutional layers, four Maxpooling layers, and two fully 
connected layers. During the training phase, our OP-CNN receives a fixed-size (224x224) image as input, 
which is then passed through a convolutional layer, where we use the 32 filters with small size: (3x3). This 
convolution produces 32 features maps with dimensions of (222x222). The features maps obtained will be 
entered into a second convolutional layer with 64 filters that is (3x3) in size, and the features maps of the 
second convolutional layer will be transmitted to the third convolutional layer with 64 filters. Then, we 
included the max-pooling to reduce the image size, parameters, and calculation. Max-pooling is carried out 
over a (2x2) size. Following that, we added three convolutional layers, where we use 128 filters with size: 
(3x3). In addition, we add the max-pooling after each convolutional layer. 

In order to complete our proposed architecture, we add two fully connected layers after a stack of 
convolutional layers series. The first Fully connected layer contained 64 neurons and the second is the 
sigmoid layer because we have the binary classification problem. The architecture of the proposed OP-CNN 
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is shown in Figure |. In our proposed architecture, we use the activation function (ReLu, i.e, rectified linear 
unit), dropout (for avoiding the overfitting problem), and sigmoid function for binary classification (In our 
case, we have two classes i.e., healthy and diseased strawberry) (see Figure 1). 


3. RESEARCH METHOD 
3.1. Data pre-processing 

The OP-CNN model we propose, as well as other CNN architectures, is divided into two phases. 
The first is data pre-processing and augmentation. The second is plant disease detection and diagnosis using 
the proposed models. This study relies on data from the Kaggle since our Larache Province (northern 
Morocco) currently lacks strawberry-specific data. The strawberry dataset contained two classes (healthy 
strawberry and diseased strawberry that is affected by leaf scorch disease). The dataset used represents 2000 
images in totality. For pre-processing step, we divided the dataset randomly into 80% for the training phase 
and 20% for the testing phase using the Spyder platform. 


3.2. Data augmentation 

Data augmentation (DA) is a technique used in deep learning to generate new images from the 
original dataset by random transformation. The goal of the DA technique is to solve the problem of few 
images in the training phase. According to this issue, the network can give unsatisfactory results. For that, the 
data augmentation adds the noise in the dataset to improve the generalizability of the model's results when 
confronted with new images. Furthermore, the data augmentation technique can overcome the overfitting 
issue [13], [14]. 


3.3. Convolutional neural networks CNN 

Artificial intelligence (AI) has been widely applied in various areas, and it has emerged as a critical 
tool for the advancement of the world [15]. In the agriculture field, AI helps the farmers to use the 
intelligence systems and technologies thanks to their intelligence features compared to traditional techniques 
[16]-[18]. In this paper, we propose convolutional neural networks (CNN). The CNN is one of many 
Artificial Intelligence methods; more precisely is a type of the Neural Network employed in the computer 
vision domain [19], [20]. The main success of CNN is its ability to learn and classify large image datasets 
with great results [21] versus traditional neural networks and machine learning algorithms that increase the 
level of plant diseases detection [12]. 


3.4. CNN architectures 

There are several architectures for CNN. For that purpose, the proposed OP-CNN model is 
compared in this study to the most commonly used architectures in plant disease diagnosis (or identification), 
such as DenseNet, VGGNet, and ResNet. These architectures are pre-trained and based on transfer learning. 

—  DenseNet is a dense convolutional neural network created and suggested by Tsinghua and Cornwell 
University and Facebook AI Research [22]. In a feed-forward manner, DenseNet connects each layer to 
the other layers. Among the advantage of DenseNet architecture is the improvement of feature 
propagation and minimizing the number of parameters. The DenseNet combines the input layer, 
convolutional layers (Conv) series, global average pooling, Fully Connected layers, and so on. For our 
research, we used DenseNet 121, which is a type of DenseNet. 

— VGGNet is a visual geometry group net suggested by Simonyan and Zisserman [23] and was got second 
place in the 2014 competition. The VGGNet was trained previously on the ImageNet dataset. This 
architecture has a good power to extract the features image and consists of the Input layer, 
convolutional layers series, maxpooling, and fully connected layers. This paper proposes VGG 19, a 
VGGNet variant with 19 deep layers [24]. 

—  ResNet is a residual neural network introduced in the 2015 competition and was a winner of this 
competition by giving significant results in classifying the ImageNet dataset with the minimum error. 
The ResNet model offers different benefits, such as reducing the difficulty of training deep neural 
networks and optimizing the parameters [25]. The ResNet structure contains the input layer, 
convolutional layers series (Conv), batch normalization (BN), global mean pooling (GMP), and Fully 
Connected layers. In our work, we propose ResNet 50 type with 50 deep layers. 


4. RESULTS AND DISCUSSION 
The proposed OP-CNN and other CNN architectures were trained using a strawberry plant dataset, 
as illustrated in Figure 2, where Figure 2(a) shows healthy strawberries and Figure 2(b) shows diseased 
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strawberries (i.e., strawberries leaf scorch). The strawberry plant dataset was divided into train and test sets 
as shown in Table 1, and the performance of the OP-CNN and other CNN architectures was evaluated using 
the testing set. Thus, the different validation metrics used for examining the overall effectiveness and the 
quality of the prediction models (i.e., classification accuracy, precision, recall, F,Score, and confusion 
matrix) are described in (1)-(4) and Figure 3. 


(b) 


Figure 2. Images from our used strawberry plant dataset (a) healthy strawberry and (b) strawberry leaf scorch 


Table 1. Original strawberry plant dataset division into train and test sets 


Class/total Training (80%) Test (20%) Total (100%) 
Healthy strawberry 800 200 1,000 
Diseased strawberry (leaf scorch) 800 200 1,000 
Total 1,600 400 2,000 
np P TP+TN 
Classification accuracy = = - (1) 
Total number of plant images 
Precision (Pre) = = (2) 
~~ TP+FP 
TP 
Recall (Rec) = eT (3) 
_ (Pre x Rec) 
F, Score (F,) = 2 x ror ReO (4) 


Classification accuracy is the percentage of the sum of true positive and true negative on the total 
number of plant images, as expressed in (1). The precision metric indicates the number of correct predictions 
made by the model, as expressed in (2). The recall and F,Score metrics represent the predictive model's 
effectiveness, as expressed in (3) and (4). 

Before transferring the models to the training step, we used the data augmentation criteria (e.g., 
shear range, zoom range, horizontal flip, and so on) to increase the number of strawberry plant images and 
improve the accuracy performance of the models. Furthermore, all models have the same image size as input 
(224x224), optimization method (Adam), and same size of selected batch (32). The proposed OP-CNN and 
other CNN architectures were compiled with GPU support and realized with the Keras framework that 
represents an open-source library of deep neural networks using Python language. 


4.1. The experimental tests 

Figure 3 depicts the confusion matrix for all models used. According to Figure 3(a), the proposed 
OP-CNN produces more excellent performance than other state-of-the-art CNN architectures, indicating that 
the OP-CNN classified correctly all strawberry plant images with 0 misclassifications. Strawberry plants 
were nearly correctly classified using DenseNet 121 and VGG 19 with 2 and 10 respectively for 
misclassifications, as shown in Figure 3(b) and Figure 3(c). In comparison, ResNet 50 has 147 
misclassifications, resulting in poor performance, as shown in Figure 3(d). 

As seen in Table 2, the proposed architecture and other CNN architectures obtain nearly similar 
ratios of precision, recall, and F,Score, except ResNet 50, which provide unsatisfying performance. 
Moreover, the proposed OP-CNN offer best performance for all classes during ten epochs versus 30 epochs 
for other CNN architectures. As a result, the proposed OP-CNN significant improves the performance 
outcomes with fewer epochs. 

From Table 3, it can be noticed that the OP-CNN model gives the best average (100%) in all 
validation metrics versus CNN architectures. Furthermore, Figure 4 shows that the proposed OP-CNN 
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architecture reaches the highest value against DenseNet 121, VGG 19, and ResNet 50 in classification with 
100%, 99%, 97%, and 63%, respectively for accuracy. Figure 4 illustrates the classification accuracy 
achieved by the proposed OP-CNN and other CNN architectures in the test set. 


Healthy Strawberry 200 0 
Strawberry Leaf 2 198 
Scorch 
Healthy Strawberry Healthy Strawberry 
Strawberry Leaf Scorch Strawberry Leaf Scorch 
(a) (b) 
Healthy Strawberry 200 0 200 oO 
Strawberry Leaf 10 190 147 53 
Scorch 
Healthy Strawberry Healthy Strawberry 
Strawberry Leaf Scorch Strawberry Leaf Scorch 
(c) (d) 


Figure 3. The confusion matrix comparison between (a) the proposed OP-CNN, (b) DenseNet 121, 
(c) VGG 19, and (d) ResNet 50 for each class (healthy strawberry and diseased strawberry 
(i.e., strawberry leaf scorch)) 


Table 2. The performance of the proposed OP-CNN vs. DenseNet 121, VGG 19, and ResNet 50 using a test 
set for each class 


CNN architectures Classes Precision Recall F, Score Epoch Numbers 
and OP-CNN (%) (%) (%) 

DenseNet 121 Healthy Strawberry 99% 100% 100% 30 
Diseased Strawberry 100% 99% 99% 

VGG 19 Healthy Strawberry 95% 100% 98% 30 
Diseased Strawberry 100% 95% 97% 

ResNet 50 Healthy Strawberry 58% 100% 73% 30 
Diseased Strawberry 100% 27% 42% 

OP-CNN Healthy Strawberry 100% 100% 100% 10 
Diseased Strawberry 100% 100% 100% 


Table 3. Average results of the proposed OP-CNN and CNN architectures on the test set 
CNN architectures and OP-CNN Average precision Average Recall Average F; Score 


(%) (%) (%) 
DenseNet 121 100% 99% 99% 
VGG 19 98% 97% 97% 
ResNet 50 79% 63% 58% 
OP-CNN 100% 100% 100% 


From the results obtained, we can conclude that the proposed OP-CNN outperforms CNN 
architectures, in particular ResNet 50 (i.e., the classification accuracy of 100% for OP-CNN and 63% for 
ResNet 50). The best improvement of the proposed OP-CNN architecture in the classification accuracy was 
1% than DenseNet 121, 3% than VGG 19, and 37% than ResNet 50 on average (see Figure 4). The ultimate 
objective of this study is to propose another efficient architecture to diagnose and detect the diseases of the 
strawberry plant, which is among the problem found in the Larache Province (northern Morocco). 
Additionally, the proposed OP-CNN was compared to other state-of-the-art CNN architectures in order to 
evaluate its performance. 

The early detection of plant images represents a challenging task in agriculture precision. For that 
reason, our proposed model is highly accurate and appears to be promising for improving agricultural 
productivity through the early detection of strawberry plant diseases, which provide farmers to take adequate 
solutions. Therefore, the proposed OP-CNN can replace the traditional technique used for farmers with high- 
quality diagnosis and classification. Besides that, the proposed architecture could decrease the number of 
labourers and increase agricultural productivity. 
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Figure 4. The classification accuracy comparison between the proposed OP-CNN and other CNN architectures 
on the test set 


5. CONCLUSION 

Plant disease diagnosis and identification is a very important task. In recent years, Morocco has 
experienced declining agricultural productivity in most of its regions due to climate change and other factors 
such as biotic factors. Strawberry is a major crop in the Larache Province (northern Morocco). Therefore, in 
this work, we proposed the OP-CNN architecture to diagnose and detect strawberry plant diseases to assist 
farmers in making the best decision. To validate the performance of the proposed architecture, we compared 
it to other CNN architectures in terms of classification accuracy, precision, recall, and F,score. The 
experimental tests demonstrate that the proposed OP-CNN is capable to detect the plant diseases of the 
strawberries, and it significantly improves the diagnosis task. Thus, the main contributions of this article are 
given as follows: a) this work contributes by proposing another accurate OP-CNN architecture to diagnose 
and detect strawberry plant diseases; b) the proposed OP-CNN achieved an excellent performance compared 
to DenseNet 121 VGG 19, and ResNet 50 with 100%, 99%, 97%, and 63% respectively for classification 
accuracy; c) the proposed OP-CNN reached better average results versus DenseNet 121, VGG 19, and 
ResNet 50 with 100%, 100%, 98% and, 79% respectively for precision, 100%, 99%, 97%, and 63% 
respectively for recall, and 100%, 99%, 97%, and 58% respectively for F4 Score. 

In the future article, we aim to use the real dataset from Larache Province (northern Morocco) for 
testing and validating various CNN architectures and the proposed architecture. Additionally, we will 
improve the ResNet 50 architecture in accuracy terms. These objectives help the farmers of this province to 
increase their crop productivity by limiting yield losses. 
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